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Using the maximum-entropy method, we calculate the end-to-end distance distribution of the force 
stretched chain from the moments of the distribution, which can be obtained from the extension- 
, force curves recorded in single-molecule experiments. If one knows force expansion of the extension 

. through the (n — l)th power of force, it is enough information to calculate the n moments of the 

^Nj ' distribution. We examine the method with three force stretching chain models, Gaussian chain, 

free-joined chain and excluded-volume chain on two-dimension lattice. The method reconstructs 
all distributions precisely. We also apply the method to force stretching complex chain molecules: 
the hairpin and secondary structure conformations. We find that the distributions of homogeneous 
H-H I chains of two conformations are very different: there are two independent peaks in hairpin distribu- 

O , tion; while only one peak is observed in the distribution of secondary structure conformations. Our 

discussion also shows that the end-to-end distance distribution may discover more critical physical 
' information than the simpler extension- force curves can give. 



o 



o 
o 



I. INTRODUCTION 



Recent advances in the molecules manipulation have made it possible to measure and characterize molecular prop- 
erties at a single molecule level. One of basic characteristics is the extension- force curves (EFCs)0, |[ These 
^ , curves have provided lots of interesting and useful physical information about studied molecules, going from the de- 
' tailed elastic properties]^ to complex structure transitions^, ||, |^. On theoretical side, many kinds of models have 
, been constructed to character and explain the recorded various EFCs of different molecules 0, |, |[ Except 
computer simulations, e.g., molecular dynamic or Monte Carlo sampling, the calculation of the end-to-end distance 
[ distributions (EEDDs) of the force stretched molecules is the center problem in using statistical mechanical method. 
. In principle, EEDDs can be obtained by partition function. But two questions must be faced firstly: one is what 
' physical interactions should be taken into account; the other is what mathematical technique is needed to solve the 
"j—; EEDDs. It is not easy to describe physical interactions in complex molecules, such as polyelectrolytes or proteins. 
d While specific mathematical technique is not always useful in different molecular systems. 

^ In contrast to traditional mind, in this paper we try to extract EEDDs from the recorded EFCs in experiments 

I using the maximum-entropy (or least-biased) method (MEM). Our motivations are that, first, to our knowledge, little 
■ O concern about EEDDs has been given in previous force stretching models. Although EEDDs of simpler molecules 
^ may be simpler enough, it is no reason to assume that they are still simple for complex molecules, such as secondary 
structure RNA; second, because the EEDDs are the results of interplay between intra-molecule and force, they can be 
seen as primitive examining for more realistic physical models. In addition, our studies also show that the EEDD at 
. !^ ' vanishing force calculated by MEM keeps almost all characteristic of the exact distribution without force. Therefore 
^> , this method may provide a possible way to directly "measure" EEDDs by force spectroscopy. 

5^ ' The organization of this paper is as follows. We first, in Sec. ||, briefly review the maximum entropy method. In 
' Sec. Ill, the basic relations between the distance moments and EFCs are demonstrated. In Sec. IV, MEM is examined 
by reconstructing EEDDs of three force stretching chain models: Gaussian chain, free-joined chain and self-avoiding 
chain on 2-dimension lattice. We also show in Sec. ^ that the method is capable of resolving EEDDs of complex 
chain molecules stretched by force. As an illustration, the model of force unzipping double-stranded chain molecules 
is used to provide exact EFCs and EEDDs |12[; these results are necessary to calculate and compare EEDDs solved 



by MEM. Section. VI is our conclusion 
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II. MAXIMUM ENTROPY METHOD 



Given a finite set of the moments of a distribution function, how to construct the function is an old mathematical 
problem. The MEM has been proved to be useful in this problem [p^ From a normalized distribution function 
P{z) on the interval (0, 1), the power moments are calculated as 

^in - /2"P(z)rfz. (1) 



On the other hand, given a set of (M + 1) moments, from fiQ to /ijv/, it is not always possible to find a positive, 
well-behave function P{z) that will have these moments. The necessary and sufficient conditions for the existence of 
a function P{z) with a set of {M + 1) moments on interval (0, 1) are the Hausdorff relations p^: 

k 

E (^1)™ (m) ^"+'" - ° ' ^) = (0: 0) to n + fc < M. (2) 

m=0 ^ ^ 

The MEM offers a definite procedure for the construction of the approximate distribution Pm{z) based on {M + 1) 
moments as the following formp3|: 



Pm{z) = exp 



M 



J2 



(3) 



The A„ are a set of {M + 1) constants determined by the (M+ 1) known fin- This involves a straightforward nonlinear 
iterative procedure that usually converges rapidly |l3[. 

In general, a real distribution is not always defined on interval (0, 1). Hence the first step in using MEM is to 
convert the distribution to a function on this interval Given that the power moments of the original distribution 
f{x) are 7„i, and the lower and upper bounds are designated as ai and a2 respectively. Defining the extent of the 
distribution 

L = a2 — ai. (4) 

First, shift the moments 7„i to interval (0, L) by 

M„=E 7™. (5) 

m=0 ^ ^ 

Then scale these moments /!„ to interval (0, 1) by 

fin=lljL". (6) 

Thus MEM can be used to calculate distribution P{z). 

Conversely, if the approximate distribution P{z) is solved from moments 7^, the distribution can be first rescaled 
from interval (0, 1) to (0, L) by the change of variable y 

9iy) = TP(T)^ ye(o,L). (7) 



L 

Then shift the distribution g(y) to interval (ai,a2) by 

f{x)^g{x-ai), we(ai,a2). (8) 



III. MOMENTS FROM EXTENSION-FORCE CURVES 



Assuming that one end of a chain consisting of A^-links is fixed at origin, and external force /zq is exerted on the 
other end, where unit vector Zq is along z-axis. Let PAr(R, /) be the probability distribution function that the end-to- 
end vector of the force stretched chain is R = {Rx, Ry, Rz)- Then the power moments of component Rz distribution 
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Pn{Rz, f) are calculated by 

i?f (/) = < (R • zo)™ > 

- j dR,XR,rPN{R.,f) (9) 

- j d3R(R.zo)™p^(R,/). 

In order to illustrate expressions more seriously and explicitly, distribution function of the ideal chains is used[^ |l^ 

Piv(R,/) = Q-\f] Jv[ris)]d'(ll-£ds^{s)^ 

[ dspe{r,s) + --^zo- [ 
'bT Jo ksT Jq 



X exp 



dsv(s) 



(10) 



where ks is Boltzmann constant, T is temperature, L is arclength of the chain, "vector" r(s) describes the local state 
at arclength point s, e.g., in the case of a flexible Gaussian chain, r is a three-dimensional position vector; while in 
case of a wormlike chain, r is the unit tangent vector||l^. Vector v(s) is also different according to concrete chain 
model, e.g., v = dv/ds for Gaussian chain, and v = r(s) or the tangent vector for wormlike chain. The normalization 
factor Q[f] is 



S[/] 



j dH j V[r{s)]5 i^- dsv 



X exp 



knT 



(^) 



dspe{r, s) + ^-^zo • 
knT 



dsv{s) 



= J 2?[r(s)]exp dspeir,s) 
Replacing Eq. into Eq. ^ and performing R integral we have 



knT 



dsv(s) 



(11) 



RTif) - Q"'[/] / V[r{s)]{zo- / ds^is 



X exp 



knT 



B-L Jo 



dspe{r, s) + 



knT 



dsv{s) 



It is easy to prove that Eq. 12 can be rewritten as 



Q[f] 9/'" ' 

The first moment is just the average extension Z{f) recorded in experiments as a given force /, 

ksT d 



Rl - Z{f) 



Q[/] df ■ 

We can alternatively relate the partition function Q[f] to derivatives of Z{f) with respect to / as follows: 



(12) 



(13) 



(14) 



_l d_ 

k^df 
1 

ksT dp 
1 6>3 

ksT dp 



Z{f) 
Z{f) 

z{f) 



= 2 



= -6 
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Q 
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Q ' 



(15) 
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and so on, where 

= (16) 

According to Eq. ^ the moments R^{f) can be solved in terms of derivatives of Z{f) with respect to force / as 
foUows: 

M = z{f), 

i?f - kBT^Z{f)+(Riy , 

^ = (kBTr^Zif) - 2 + 3 (ri) (i?!) , (17) 

m = [kBTf^Zif) + 6 - 12 (Rtf (i?f) + 3 (i?f)' + 4 (Rt) (i?f 

The above relations show that if one has the first {n — 1) derivatives of Z{f), then this is enough information to 
calculate the first n moments of the distribution Pn{Rz, /). These derivatives of Z{f) can be obtained by expanding 
the extension Z{f) in a Taylor series about the reference force /o as 



r) 1 r)2 1 r)3 



Z{f) = Z(/o) + — ^(/o)A/ + - — Z(/o)A/^ + -_Z(/o)Ar, (18) 
where 

A/ = / - /o. (19) 

In general, no analytical Z{f) is used in real situation; only the EFCs are recorded in experiments. All the derivatives 
have to be calculated by numerical methods. 

Before beginning the next section, we clarify our procedure: first calculate different derivatives of Z{f ) from EFCs 
by numerical method; then use Eq. ^ to obtain necessary power moments of Pm[Rzi f)', and finally, apply MEM 
presented in Sec. || to construct approximate EEDDs. 



IV. TEST OF MEM: THREE FORCE STRETCHING CHAIN MODELS 



In this section, the MEM is examined with three force stretching chain models which have different statistical 
properties: Gaussian chain, free-joined chain and excluded-volume (EV) chain on two dimension. The main reason 
to choice these model is that their EEDDs with forces have exact expressions. EEDDs and EFCs of the models are 
solved by statistical mechanical methods firstly. Then seeing the obtained EFCs as experiment data, approximate 
distributions is computed by MEM according to procedure mentioned in above section. Distributions solved by two 
methods are compared finally. 

For each chain model, EEDDs at three nonzero forces are calculated respectively. In addition, distributions at zero 
force are also solved. Because that EEDD without force is important in polymer research, such as the calculation 
of root-mean-square end-to-end distance, it is interesting to see whether the distributions calculated by MEM at 
zero force can keep main characteristic of the exact EEDD. Though our method only solve Pn{Rz_iO), the length 
distribution P/v(R, 0) could be obtained from the numerical relations provided by Domb et al. early pC||. 



A. Gaussian model 

As the simplest A^-link chain model, the energy with force /zq in Gaussian modcl|jl^ is expressed as 



where h is effective bond length, r is position vector. Using path integral method p^, p^ , p^ , the EEDD of Gaussian 
chain stretched by force is derived as 



Correspondingly, the component Rz distribution in Zq direction can be integrated by 



1 



2TrNb^ 

The extension versus force then is calculated as 



3/2 



exp ■ 



27V&2 



Rz 



N9f 



Z{f) = J dRzRzPNiRzJ) 



/• 



(22) 



(23) 



As an illustration, we choice = 16 and plot the function in Fig. |l|. This function will be viewed as EFCs "measured" 
in experiments. 




fb/KJ 



FIG. 1: EFC of the Gaussian chain, here A'^ = 16. The three arrows point out forces in which corresponding EEDDs are 
calculated by MEM. 



Before applying MEM, firstly expand Eq. |2^ in Taylor series about force /o as 

^(/) = ^(/o) + ^(/-/o), 



(24) 



or Zjfo) — Nb'^fo/SkBT and dZ/dfo = Nb^ /SksT. Three moments can be obtained at any given force /o through 
Eq. |l^ directly. Then approximate distributions are solved by MEM. Three distributions calculated by MEM and 
their comparing with exact Eq. ^ at forces 0.0, 1.0, 2.0, S.OkgT/b are shown in Fig .^j. Considering that the extension 
is linear with force, three moments approximation is used in this model. Obviously, MEM can precisely reconstruct 
distributions of the Gaussian chain. In fact, because the approximation function P2{Rzif) is just the Gaussian 
distribution, it is not unexpected that MEM reconstruct EEDDs of Gaussian chain perfectly. In addition, EEDD at 
/ — O.OkBT/b is the same with distributions at nonzero forces, since the first-order derivate of Z{f) is constant at 
any force. 



B. Free-joined chain model 



Free-joined chain model has been used to fit the observed EFCs of force stretching single-stranded DNA 
;periments|l|. The model is defined as a chain with N-link of length b in which all rotational angles occur with 
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FIG. 2: Comparing EEDDs solved by MEM (the black lines) with exact EEDDs calculated by Eq. (the blue lines) for force 
stretching Gaussian chain model: (a) / = Q-OksT /h; (b) / = 1.0, 2.0 and S.OfcsT/b. Here three moments approximation is used 
in MEM. Overlapping of two color lines demonstrates that the MEM can precisely reconstructs the exact EEDDs of Gaussian 
chain at any force value. 



equal probability [|T5[ When exerted external force /zq on one end of the chain, the force potential energy is 
written as 



N 



£f = -/zo • r„, (25) 

n=l 

where r„ are bond vectors with constant length |r„| — b. According to Eq. the distribution function PN(R,f) of 
the end-to-end vector R is 

P m n exp[/3/zo.R]FAr(R,0) 



J dR exp [/3/zo ■ R] Pn (R, 0) ' 

(26) 

where (3 = l/ZcsT, and PAr(R, 0) is the EEDD without applied force, which has been given in literature JTsI 

[{N~R/b}/2] , , 

= 2^^HN-2)m ^ N ) {N-2n^R/br-\ (27) 

V / n— ^ ^ 

where R is the length of vector R. The normalization factor or the partition function T] can be calculated exactly 
as 



Q[/,T] - j rfRPAr(R,/) 



N 



Then EEDD of component can be obtained by integral of Eq. |2^ with respect to components R^ and Ry as 
where 

. .JV liN-x)/2] 

2^{N-2y.b J^ E (-ir(^;j j(^-2n-x)^-2, R, > 0. (30) 
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FIG. 3: EFC of free-joined chain, here N—16. The dot-dash and dash curves are asymptotic curves corresponding to large 
and small forces respectively. Three arrows point out different nonzero forces in which corresponding EEDDs are calculated by 
MEM. 



The average extension Z{f) is then given as 



Z{f) = bN 



fifh 



■ coth {PJb) 



(31) 



Eq. |3^ is served as experiment data to check our MEM. As an example, take N=16, and its EFC is shown in Fig. 
We expand Eq. |3^ in Taylor series at different forces, 0.0,0.5, 1.7, and A.OkBT/b. EFC at these forces has different 
asymptotic formula; see dash curves in Fig. ||. Similarly to the case of of Gaussian model, EEDDs calculated by 
MEM and their comparing with exact distribution are shown in Fig. ^. At / = O.OfcsT/t, because the third-order 
derivative of Z{f) is not zero, EEDDs of three and five moments are solved by MEM respectively; see Fig. ^(a). The 
distributions of three and five moments are slightly different at origin: EEDD of three moments is the same with the 
distribution of Gaussian model; while distribution value at origin calculated by five moments is smaller, which is the 
same with prediction of exact EEDD. Our results show that MEM is sensitive enough to detect the fine difference of 
EEDDs from simple EFCs. When force is nonzero, EEDDs solved by MEM are the same with exact EEDDs. 



C. Self-avoiding chain model 



As a more realistic model, the self-avoiding chain which accounts for EV interactions plays very important role in 
polymer theory p^, p^ . But in force stretching problem, almost all theoretical models implicated that EV interaction 
can be negligible. This assumption is doubted at small force region. In this section. We try to simulate the force 
stretched EV chain of A^-link as iV-step self-avoiding walks (SAW) on two dimensional (2D) quadratic lattice. The 
early work of Domb et al. has demonstrated that EEDD of self-avoiding chain differ appreciably from Gaussian 
distribution [po[. It is interesting to see weather MEM can recover the EV properties exactly when the force tends to 
zero. 

First we formulate the force stretching partition function Q[f,T] and EEDD PN{n,f) as follows 

n=+N 

Q[f,T]= J2 C%{n)exp{f(3n), (32) 

n=-N 

and 



(33) 
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FIG. 4: Comparing EEDDs solved by MEM (the black lines) with exact EEDDs given by Eq. ^ (the blue lines) for force 
stretching free-joined chain model: (a) / = Q.OksT/b. We calculate the EEDDs using three (black dash line) and five moments 
(black solid line), respectively. The EEDD of five moments slightly derives from the distribution of three moment at origin, 
which is confirmed by exact EEDD. (b) / = 0.5, 1.7 and A.QkBT/h. Here five moments are necessary. Unlike in Gaussian 
chain, not only the maximum values of EEDDs are movable, but also the distribution regions are variable at different forces. 
Overlapping of two color lines show that MEM can very precisely reconstruct the EEDDs of free-joined chain at any given 
force. 



where (n) is the number of walks whose final x coordinates are n. Extension function Z{f) then can be calculated 
from EEDD accurately. 

As an illustration, we exactly enumerate all 20-step SAWs on 2D lattice. According to Eq. ^ we calculate EEC 
and plot it in Eig. ||. The numerical expansions of extension of the chain at forces 0.0,0.10,0.25, and OJOksT/b 



16 



14 - 
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FIG. 5: EEC of SAW chain, here A'^ = 20. Three arrows point out different forces in which corresponding distributions are 
calculated by MEM. 



are calculated respectively. Then using MEM, EEDDs at these forces are solved; see Eig. I At force OMsT/b, 
distributions of three and five moments are different apparently. Domb et al. have pointed out that instead of 
Gaussian distribution, the distributions considering EV effect on 2D lattice can be well fitted by a function form of 
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exp(— I a; I ), which will be seen that the portion of the EEDD near to the origin is more "flat-topped", and the decay 
of distribution for 

larger values of x is sharper|^^. EEDD calculated by five moments at zero force precisely recovers these major 
aspects. However, it is unexpected that even a slight dip in the value of distribution can be recovered by the MEM; 
see Fig. ||(a). Because the origin of the dip arises from the restriction of no returns to the originj22l, the result 
demonstrates again that the MEM is very sensitive to detect the fine structure restrictions from the simple EEC. This 
characteristic in distribution is still preserved at small forces, such as at force O.lOkBT/b in Fig. |^(b). 

From the analysis of self-avoid chain, we conclude that EV may play important role even in force stretching problem. 
Especially, EV effect can be reflected more explicitly from EEDD, instead of simple EFCs. 




Z/b coordinate Z/b coordinate 

(a) (b) 

FIG. 6: Comparing EEDDs solved by MEM (lines) with exact EEDDs (symbols) given by Eq. ^ for force stretching self- 
avoiding chain, (a) / = 0.00fc_Br/6. We calculate EEDD using three and five moments respectively. Unlike EEDDs of Gaussian 
chain or free-joined chain at / = 0., two peaks in EEDD obtained by five moments (solid lines) appear in this model, which 
are demonstrated by exact enumeration (circles), (b) / = 0.10,0.25 and Q.lQksT /b. Five moments are necessary. Good fitness 
of the lines and symbols shows that EEDDs calculated by MEM can recover real distribution precisely. 



V. EEDDS OF FORCE STRETCHING COMPLEX MOLECULES: HAIRPIN AND SECONDARY 

STRUCTURE CONFORMATIONS 

From the deduction of Eq. the relations are independent of interactions between the units in a chain. On the 
other hand, units of any real polymer always interact with each other, e.g., the simple electrostatic repulsion of the 
phosphodiester backbone of DNA, and complex hydrophobic interaction in proteins. Hence it is valuable to see what 
MEM can tell us about the interactions in molecule. Because recent mechanical single molecular experiments have 
turn their attentions to molecular structure transitions induced by force, such as dsDNA or ssDNA (RNA) force 
unzippingH, ||], it is natural to apply MEM to these experiment data firstly. In particular, the EEDDs at critical 
force is of interest]^. However, considering that the MEM is very sensitive to the shapes of EFCs, and current 
experiment data are not fine enough, in this paper we do not ready to apply our method in experiment data directly. 
In this section we will make use of EFCs solved by an theoretical model of force stretching hairpin and secondary 
structures conformations in 2D plane|l2j as "experiment" data. Because our model also provide the exact EEDDs, 
comparing with EEDDs derived by MEM will ensure the availability of MEM when our method is applied to real 
scenario in future. In following section, we first give a brief overview about the statistical model of force stretching 



chain molecules of hairpin and secondary structure conformations. The details of the model are given elsewhere] 12 
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A. A statistical mechanical model of force stretching chains of hairpin and secondary structure 

conformations 

Hairpin and secondary structure conformations are the basic models for antiparallel /3-sheet in protein and RNA 
molecules pH, |2^ . The partition function Qn{T] /) of a {N + 1) monomer ((N+l)-mer) chain molecules stretched by 
force / is formulated as 

Q^(T;/) ^^^5^(i?;A)e"'^(^-/^), (34) 

E A 

where A is end-to-end distance (EED) of the chain along force direction, and gN{E; A) is the number of conformations 
having energy E and EED A. Because the energy contributed by force is only related with EEDs, we divide any 
conformation of the chain into two parts: one is main chain (MC), in which does not involve any contacts; the other 
is nested regions (NRs), which form hairpins, loops and turns. If only one NR is allowed in conformations, they are 
named hairpin, otherwise secondary structure conformations. On 2D lattice, the nested regions contribute ±1 or to 
whole EED value according their outmost contacts directions [p2|. The gN{E] A) is simplified as a multiplication of 



the number of conformations of MC and NRs, i.e., Eq. 34 can be rewritten as 



Qn{T; f)^J2J2Il ^''''("' A)C^«^(n, E)e-P(^~f^\ (35) 



E A 



where n is the number of unrelated NRs in conformations, C*^'^(n, A) and C^^''{n, E) are the number of conforma- 
tions of MC and NRs respectively. For hairpin conformations n — 1. 

Because our model is restricted on 2D lattice, the values of C*^'^(n, A) at given n can be counted exactly by 
enumeration and extrapolation method fl^. Whereas calculation of C'^^^{n, E) is modified and extended from nested 
polymer graph theory (NPGT) developed by Chen and DiU recently (2l], |2|. The idea behind the NPGT is that 
the number of conformations of any arrangement of NRs is a product of each number of conformations of each 
NR restricted by EV requirement. In NPGT, different arrangement of NRs is represented by polymer graph, the 
diagrammatic representations of intrachain contacts, and each unrelated NR can be independently seen as a polymer 
graph or subpolymer graph. So the calculation of the number of conformations for any given subpolymer graph is the 
centra of NPGT. According to the NPGT, the number of conformations of any subpolymer graph having m subunits 
is a product of matrices: 

U • St,„ • Yt„t„_i • St„_, • • • St, • U*, (36) 

where U = {1, 1, 1, 1}, U* is the transpose of U, is structure matrix of ith subunit, and is viability matrix ]2l|, p2[ . 
We obtain EEDD from the partition function Q(T; /), 

P^(A, /) ^ e^^^ X ^''''("' A)C^«^(n, E)e-^^/Q^{T- /), (37) 

n E 

and the average extension function Z{f) is calculated exactly from above EEDDs. For comparing, we calculate 
EFCs of 70-mer homogeneous chains of hairpin and secondary structure conformations; see Fig. ^. Here the ho- 
mogeneous chain means that any contact of two monomers in chain contributes energy —e {e > 0). Considering 
to the importance of sequence in secondary structure molecules, we also give EEC of a 70-mer specific sequence, 
A- ■ • ACCCCCU- • • UC- • • CAAAAAG- • • G, where the dots represent 15-mer A, U, C and G respectively; see Fig. |(a). 
In contrast to homogeneous chain, only A-U or C-G pair contributes energy —e. 

B. Single- and multipeak distributions 

To be the same with previously section, we compute all EEDDs by numerical expansion of EFCs at different forces 
for different chain molecules; see Figs. I and |(b). 

The shapes of the EEDDs of the complex molecules are very different from those of simple molecules observed in 



Sec. [V. The most obvious feature is that the distribution regions of the complex chains expand in middle force whereas 
shrinking at smaller and larger forces. It is results of attracted interaction between monomers. Secondly, it seems 
that EFCs of homogeneous chains of hairpin and secondary structure conformations are similar except that extensions 
increase slowly or fast, however, the EEDDs calculated by MEM are completely different: only one peak is observed 
in the distribution at any force in secondary structure conformations; see Fig. |^ (a); while in hairpin conformations, 
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FIG. 7: EFCs of 70-mer homogeneous chains of hairpin (blue hues) and secondary structure conformations (black line). Here 
temperature is 0.28e/kB- Six arrows point out different forces in which corresponding distributions of two conformations are 
calculated by MEM. 
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FIG. 8: Comparing between EEDDs solved by MEM (lines) and exact EEDDs given by Eq. ^ (symbols) for homogeneous 
chains, (a) Secondary structure conformations, where / = 0.20, 0.50 and 0.70e/fe. (b) Hairpin conformations, where / = 0.30, 
0.46 and O.GOe/b. Five moments are necessary. Two independent peaks in EEDD at force QAQe/h appear in hairpin model 
(triangle and dash line), while it does not present in secondary structure conformations. 



two peaks located at shorter and longer EEDs present during narrow force range (between 0.42 to 0.49e/6), but 
no conformations with other EEDs in between; see Fig. || (b). In addition, EEDDs of specific sequence show more 
complex shapes. At / = 0.39e/6 the distribution has two peaks, then they quickly fuse into one peak with a small 
force increasing 0.03e/6, and finally the peak separates into two peaks at / = 0.45e/6. To explore the phenomena of 
single- or multipeak, comparing with the exact EEDD is essential. These results are plotted in corresponding figures. 
We find that the two peaks in distributions predicted by MEM are consistent with exact EEDDs of specific sequence 
and homogeneous hairpin chain. At force 0.42e/&, however the exact distribution of specific sequence appears three 

peaks, whereas MEM predicts one peak only. 

According to Eq. |l^, the first-order derivative of the average extension R\{f) with respect of force / can also be 



12 



written as dR\/df = (i?^ — i?i )/kBT. The formula is the same with the definition of heat capacity C(T) except 
that the energy and temperature are replaced by EED and force, respectively. We believe that the EED plays 
important roles in force stretching chains problem, which is very similar with the roles played by energy in thermal 
melting biomolecules, at least in nucleic acids Many useful insights can be given through this analogy. Since 
the energy distribution can reveal molecular structure transitions induced by heating |p^ , the EEDD might discover 
structure transitions driven by force. E.g., the EEDDs in Fig. || show that the transitions in secondary structure 
and hairpin conformations are "one-state" and "two-state" , respectively. These terms are borrowed from thermal 
melting case]!^, |2^ . The transition difference exhibited in two conformations warns us that the simpler EFCs may 
cover critical physical information; the investigation of EEDDs is necessary to determine physical properties in force 
"melting" chain molecules. 

In traditional theory, physical properties of polymers only relate with the number of monomers, such as cooperativity 
or melting transition typc[p^ [l9|] . But in biomolecules, the monomer sequence may affect physical results dramatically. 
The apparent case in EEDDs is the number of peaks of the specific sequence in Fig. ||, though its EFC is simpler. The 
case of specific sequence warns us again that EFCs may be too simple to obtain real and useful information about 
the studied molecules. Because five moments cannot reconstruct three peaks in distribution , we only solved a 
quick expansion in the distribution region. To explore three or more peaks in EEDDs, higher moments are necessary. 
Although our MEM fails to predict three peaks, the abnormal expansion of the distribution observed by MEM between 
two forces arising two peaks still can be seen as a sign of appearing of multipeak. 



0,15 




forcef{E/b) Z/b coordinate 

(a) (b) 

FIG. 9: (a) EFC of 70-mer specific sequence of secondary structure conformations, where temperature is 0.28e/fc_g. Three 
arrows point out different forces in which corresponding distributions are calculated by MEM. (b) Comparing EEDDs solved 
by MEM (lines) with exact EEDDs (symbols), where / = 0.39, 0.42 and 0.45e/6. Five moments are necessary. Unlike EEDDs 
of homogeneous chains, three peaks appear in the exact EEDD at / = 0.42e/6 (triangles), while the EEDD calculated by MEM 
only shows an abnormal expansion at this force (dash line). 



VI. CONCLUSIONS 



In this paper, contrary to the traditional mind, we calculate end-to-end distance distribution of force stretching 
chain molecules from the measured EFCs by using MEM. Because the method is independent of polymer energy 
formula or structure details, it provide a useful and simple way to detect the real physical information about complex 
molecules. Many results, such as the important role played by EV interactions, single- or multipeak in EEDDs can 
be obtained from the simple EFCs. It is interesting to see whether these results can be found in real stretching 
experiments. 
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